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ABSTRACT 

Clustering is an important research topic for wireless sensor 
networks (WSNs). A large variety of approaches has been 
presented focusing on different performance metrics. Even 
though all of them have many practical applications, an ex- 
tremely limited number of software implementations is avail- 
able to the research community. Furthermore, these very few 
techniques are implemented for specific WSN systems or are 
integrated in complex applications. Thus it is very difficult 
to comparatively study their performance and almost impos- 
sible to reuse them in future applications under a different 
scope. In this work we study a large body of well estab- 
lished algorithms. We identify their main building blocks 
and propose a component-based architecture for developing 
clustering algorithms that (a) promotes exchangeability of 
algorithms thus enabling the fast prototyping of new ap- 
proaches, (b) allows cross-layer implementations to realize 
complex applications, (c) offers a common platform to com- 
paratively study the performance of different approaches, 
(d) is hardware and OS independent. We implement 5 well 
known algorithms and discuss how to implement 11 more. 
We conduct an extended simulation study to demonstrate 
the faithfulness of our implementations when compared to 
the original implementations. Our simulations are at very 
large scale thus also demonstrating the scalability of the 
original algorithms beyond their original presentations. We 
also conduct experiments to assess their practicality in real 
WSNs. We demonstrate how the implemented clustering 
algorithms can be combined with routing and group key es- 
tablishment algorithms to construct WSN applications. Our 
study clearly demonstrates the applicability of our approach 
and the benefits it offers to both research & development 
communities. 

1. INTRODUCTION 

During the last decade, wireless sensor networks (WSNs) 
gained the interest of computer science, industries and academia, 
not only from theoretical but also from practical perspec- 
tives [26j. Consisting of spatially distributed autonomous 



sensor-equipped devices, WSNs allow the cooperative mon- 
itoring of physical or environmental conditions (e.g., tem- 
perature, light, pollutants, etc.), enabling a multitude of 
applications in both urban and rural contexts. 

Many of the proposed applications assume large node pop- 
ulations densly deployed over sizeable areas. Thus far, we 
have only a few examples of large-scale deployments of such 
systems in rural context, whereas in the case of urban de- 
ployments we are currently seeing great advances, as signi- 
fied by research projects such as City Sense ^16j and Smart- 
Santander [34] . 

Current wireless communication technologies used by the 
vast majority of off-the-shelf sensor nodes allow short range 
message exchanges. Thus the network is operated in multi- 
hop fashion to enable scalable routing, data aggregation, and 
querying. Designing and operating such large size networks 
requires scalable architectural and management strategies. 
Additionally, it is of high importance to design energy-aware 
algorithms for preserving the network lifetime. 

Since [s], grouping sensor nodes into clusters has been widely 
pursued by the research community in order to achieve net- 
work scalability, fault-tolerance and energy efficiency. In 
WSNs a large variety of approaches have been presented fo- 
cusing on different performance metrics. Some have been 
proposed as stand alone methods (see e.g., 23 ), others in- 
corporated as sub protocols in larger solutions designed to 
solve more specific problems such as query execution, aggre- 
gation, localization etc. (see e.g., [24[ [41] ). 

Unfortunately, even though all of them have many practical 
applications, an extremely limited number of software im- 
plementations for real sensor nodes is available to the com- 
munity 42, 24, 23 . Furthermore, these very few algorithms 
are implemented for specific WSN systems or are integrated 
in complex applications, therefore rendering them very diffi- 
cult to reuse by future application developers. For example, 
the authors of [24] implement the three cluster hierarchy 
maintenance algorithms presented in 24j [23j as part of a 
routing module for TinyOS v2.0. Similarly, the work in [42] 
implements the algorithm of [41j by extending the multi-hop 
routing module provided in TinyOS vl.O and by modifying 
the well-known Surge application in order to take advantage 
of the clustering scheme. Since these implementations are 
strongly coupled with the routing modules, using them for 
a different purpose (e.g., data aggregation, energy efficiency. 



localization etc.) requires significant development effort. 



schemes. 



Surprisingly, to the best of our knowledge, these are the 
only clustering schemes that are implemented in a contem- 
porary OS for WSNs. All other implementations of clus- 
tering schemes (e.g., the well known LEACH protocol |22] ) 
are in simulated environments (e.g., ns2) and the vast ma- 
jority are not publicly available and/or open source. This 
inevitably makes it very difficult to comparatively study the 
performance of existing approaches so that to select the best 
approach for a particular application scenario. Moreover, 
even converting these implementations into code for real de- 
vices (e.g., with 8-bit microprocessors and tiny amounts of 
RAM) is certainly a non trivial task. 

In this work, we start by examining a 90 recent clustering al- 
gorithms for WSNs in order to identify the key components 
that re-appear in the majority of the cases. We carefully 
identify two basic procedures (that of Cluster-head selection 
and Node grouping) that appear in almost every algorithm of 
the 90 studied. We then propose a component-based archi- 
tecture for developing clustering algorithms. We focus on 26 
well studied clustering algorithms and design a very limited 
set of base modules that is highly parameterizable and can 
be interchanged to implement all these algorithms or new 
ones. Essentially, we totally avoid implementing each clus- 
tering algorithm as a monolithic, stand-alone piece of code. 
The component-based approach allows us to promote ex- 
changeability of different modules that interact using well- 
defined interfaces. Modules can be exchanged with other 
implementations without affecting the remaining code. 

Out of the 90 algorithms, we select the 5 most characteristic 
and implement them under our architecture. These algo- 
rithms are carefully selected so that each of them contains 
as many unique design choices as possible. We conduct an 
extended simulation study to demonstrate the faithfulness 
of our implementations when compared to the original im- 
plementations. For all cases, the resulting implementations 
achieve almost identical performance to the one acquired by 
the original studies. We exploit the benefits of our archi- 
tecture and also conduct experiments in a hardware testbed 
in order to assess their performance in real conditions. The 
implementation effort required to develop these algorithms 
under our modular approach is significantly short; e.g., for 
the case of the well known LEACH [22] algorithm, only 2% 
of the final lines of code is unique, the rest are reused from 
other, common modules. 

Another benefit of our approach is that we provide a com- 
mon platform so that comparisons between algorithms is 
easily accessible. Developers can easily mix and match mod- 
ules in order to provide new clustering variants that best fit 
their application specifications. The ability to implement 
new algorithm variants with minimum effort is of significant 
importance for conducting experimental-driven research. 

Our approach also offers the ability to implement cross- 
layer algorithms. The modular design proposed allows to 
use the clustering algorithms as sub-protocols in other prob- 
lems such as energy conservation, routing, role assignment, 
security etc.. This reduces the application development ef- 
fort and also simplifies the implementation of more complex 



All implementations are done using Wiselib 8 : a code li- 
brary, that allows implementations to be OS-independent. It 
is implemented based on C++ and templates, but without 
virtual inheritance and exceptions. All implemented algo- 
rithms are platform independent as they can be compiled 
on a number of different hardware platforms (e.g., TelosB, 
iSense, Scatter Web) and OS independent as they can be 
automatically used in systems implemented using C (Con- 
tiki), C++ (iSense), and nesC (TinyOS). 

Finally, we conduct a thorough evaluation using both sim- 
ulated and experimental environments. For all cases, our 
simulations are at very large scale thus also demonstrat- 
ing the scalability of the original algorithms beyond their 
original presentations. The results of the evaluation also 
indicate that our implemented code achieves high scalabil- 
ity and efficiency. Moreover, for all cases, for the first 
time, we conduct experiments and assess their practicality 
in real WSN. This also demonstrates the capability of our 
approach to make code easily available to the community 
that are hardware and OS independent. 



2. CLUSTERING TECHNIQUES 

A large variety of clustering algorithms has been presented 
during the past years. In the relevant bibliography there ex- 
ist several surveys and tutorials (e.g. [l] [7| [25^ 14 ) that at- 
tempt to categorize and classify the various protocols based 
on the design choices and mode of operation. 

For example, in 1 they survey 54 algorithms and identify 
the relevant architectural parameters that play an impor- 
tant role in their design, such as the Network dynamics, 
In-network data processing and Node deployment and ca- 
pabilities. They classify the protocols based on their main 
objectives in the following categories: Load balancing, Fault- 
tolerance, Increased connectivity and reduced delay, Mini- 
mal cluster count. Maximal network longevity. Finally, they 
provide a taxonomy of the examined algorithms that takes 
into account the Cluster properties. Cluster-head capabili- 
ties and Clustering process. Similarly, in ^ they refer to 
44 clustering algorithms. They mention the most important 
parameters for the clustering procedure which are Cluster 
Count, Cluster Overlap, Cluster-head Selection, Node Mo- 
bility and Time Complexity. Then, they distinguish the 
clustering algorithms in two main categories: Probabilistic 
and Non-probabilistic, depending on the cluster formation 
criteria and parameters used for cluster-head selection. 

By carefully studying 8 surveys and tutorials ( l] HH [H 
19 To| [Tsl [20I) that present a total of 90 clustering algo- 
rithms, we identify the procedures of Cluster-head selection 
and Methodology of node grouping as part of almost every al- 
gorithm. This observation was of great value for our generic 
algorithm engineering work. Of course, the implementation 
of these procedures depends on the clustering algorithm it- 
self (e.g. probabilistic/deterministic). In the next subsec- 
tions, we concentrate on well established clustering algo- 
rithms that are examined in the surveys and we present the 
techniques they employ for the above procedures. 



2.1 Cluster-head Selection 

On Table [l] one can see the techniques used by popular 
clustering algorithms for the process of cluster-head selec- 
tion. In some algorithms, like 22, 5 each node is assigned a 
probability p of becoming a cluster-head. In algorithms like 
[32| |4|[4T], some deterministic criteria like node connectivity, 
node identity and energy are respectively used for electing 
cluster- heads. Finally, algorithms like [12| |18| are based 
on the combination of criteria in order to assign weights to 
nodes and decide the cluster-heads based on those. 
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Table 1: Cluster-head Selection Techniques of Well 
Established Clustering Algorithms 



If someone wishes to abstract the above techniques and 
group them based on their high-level behavior, two cate- 
gories may be formed: a probabilistic one and an attribute 
based one. The probabilistic approach selects cluster-heads 
based on some probability p whereas the attribute based 
approach accepts a parameter as input value (e.g., identity, 
energy, weight) and discovers the appropriate cluster-heads. 

2.2 Node Grouping 

When a node is elected as cluster- head, it advertises itself 
to neighboring nodes in order for them to join its cluster. A 
node can decide to join a cluster based on various criteria 
which can be seen on Table [5] In most algorithms, e.g., 22, 
43 6], the criterion for a node to join a cluster is the distance 
to the cluster-head. In other algorithms, like [39 33 T2j the 
nodes decide to join a cluster-head based on some cluster- 
head attribute like remaining energy, time to live, etc.. 
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Table 2: Node Grouping Techniques of Well Estab- 
lished Clustering Algorithms 



CH is the criterion for grouping, the BPS approach is suit- 
able whereas in algorithms that employ a criterion based 
on some joint computation (e.g. cluster's average energy or 
cluster's total weight), a sequential traversal of every node 
is required and thus the DPS approach is more appropriate. 
Moreover, the execution of each of these distributed traver- 
sal approaches is controlled by the desired hop-distance from 
the cluster-head. 

3. ARCHITECTURE 

The analysis of the related previous work reveals the vast 
number of clustering algorithms that have been proposed. 
It also reveals the extremely limited number of implemen- 
tations that exist. These implementations follow a mono- 
lithic approach: they are implemented as a single software 
module. In fact, all implementations integrate the resulting 
single piece of code with the routing protocol of the operat- 
ing system. Such a heavy coupling of the two code modules 
achieves good results in terms of efficient, hierarchical rout- 
ing. On the other hand, if someone wants to reuse the clus- 
tering code for a different purpose (e.g., data aggregation, 
energy saving, role assignment, security etc.) a large part of 
the code must be modified. Purthermore, if someone wishes 
to use a different clustering algorithm for routing, again he 
has to re-integrate it with the existing code. 

Another aspect of algorithm implementations for WSN is 
the fact that currently there are different hardware and 
software platforms available. The research community has 
not converged into a single software or hardware architec- 
ture. Therefore, it is of great importance to make sure that 
code can execute in different software and hardware plat- 
forms. Clearly, the monolithic implementations target spe- 
cific WSN architectures. To make them available for a dif- 
ferent platform we have to re-implement the code. 

Based on the above it is evident that we have to provide 
abstractions at two different levels: (i) at protocol level so 
that exchangeability of algorithms and cross- layer im- 
plementations can be achieved and (ii) at architecture 
level so that platform independence and OS indepen- 
dence are achieved. 

3.1 Wiselib: A Generic Algorithm Library for 
Heterogeneous Sensor Networks 

Our goal is to tackle these problems by following a component- 
based approach. A central decision to achieve this is to use 
the Wiselib |8] algorithm library. It is implemented based 
on C++ and templates, thus our code can be generic and 
parameterizable. The selection of particular target OS and 
hardware platform is done at compile time automatically 
and efficiently by the library. 



As can be seen on Table [2] various criteria can be used for 
joining a cluster. Thus, most algorithms require some kind 
of algorithmic method in order to spread the necessary in- 
formation into the network. Considering this fact, when 
trying to abstract the functionalities of existing algorithms 
regarding node grouping two approaches arise depending on 
the way the information is propagated in the network: the 
breadth first discovery (BFS) and the depth first discovery 
(DFS). For example, in algorithms where the distance to 



The Wiselib is an algorithm library for sensor networks that 
is completely written in C++, and uses templates in the 
same way as Boost and CGAL. Wiselib provides a generic 
interface to the OS, which simplifies the development process 
and decreases the need for dealing with low-level function- 
ality of specific hardware platforms. This makes it possible 
to write generic and platform independent code that is very 
efficiently compiled for various platforms, such as iSense or 
Contiki, or the sensor network simulator Shawn [28]. As an 
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nodes that have joined another cluster. Collected informa- 
tion is maintained in membership tables by the IT compo- 
nent. These tables are of crucial importance for the algo- 
rithms that will be executed on top of the clustering - they 
are necessary to ensure cross- layering. This component also 
monitors the node's neighborhood and updates the member- 
ship tables based on observed changes to the network. This 
information can be used from the other components. For 
example, changes in the neighborhood that indicate that a 
node has left a cluster can trigger a new join decision process 
from the JD component. 



example we refer to the send routine, which while exists in 
every sensor, it is implemented in a different way for almost 
every unique hardware. By providing Wiselib with the Ra- 
dio aspect of the hardware which contains the send routine, 
we can use it in our code. Thus, by making the Radio a 
template type with templated routines, we can pass differ- 
ent implementations to the library. Essentially, this allows 
us to truly implement our algorithms once and be able to 
execute them in most popular hardware platforms with just 
a simple recompilation. 

3.2 Basic Components 

The component-based design that we propose is depicted in 
Fig. [l] We partition the logic of clustering algorithms into 
three pieces with clear boundaries in terms of functionality 
provided. Each partition is designed so that it can progress 
its work in a relatively independent manner while ensuring 
that the correct functionality of the algorithm. Clean inter- 
faces are provided so that the partitions can easily commu- 
nicate, fast and without heavy information exchange. 

Cluster-head Decision (CHD). The first partition that 
we propose is related to the cluster-head selection process. 
All clustering algorithms have a mechanism for selecting 
cluster-heads. We wish to implement such mechanisms as 
a single, stand-alone, software component. Each particular 
implementation takes into account the specific design choices 
of the clustering algorithm to select which nodes will become 
cluster- heads. If the algorithm proposes periodic rotations 
of cluster- heads, the component can use call-backs to the 
Timer component so that it is re-executed periodically. If 
the algorithm is cluster-headless, the resulting implementa- 
tion will just be an "empty" implementation. 

Join Decision (JD). The second partition is related to the 
methodology by which nodes decide to join cluster-heads. 
This component constructs the necessary payloads for the 
JoinRequest/JoinDeny/ Join Accept messages, also it de- 
termines if a node will join a cluster when a JoinRequest 
message is received. The decision can be based on local 
criteria (e.g., energy levels, mobility status, etc.) and/or it 
can be related to information provided by the JoinRequest 
message (e.g., size of cluster etc.). In some other cases the 
decision may be already taken by the CHD component. 

Iterator (IT). The third partition is related to the organi- 
zation of the nodes while clustering decisions are made by 
each node. This component is responsible for categorizing 
and storing neighbors into nodes that have already joined 
the cluster, nodes that have not joined the cluster yet and 



The above three components are used by the main compo- 
nent which we call the Core Component (CC). It is the 
kernel of our architecture that controls and coordinates all 
other components so that clusters are properly formed and 
maintained. The CC provides a public interface for other 
algorithms to take advantage of the resulting network orga- 
nization. This public interface is an implementation of the 
Wiselib concept of Clustering and thus provides the cluster's 
ID, the ID of the cluster-head, and also allows to register 
a function callback in order to be able to deliver events to 
external components whenever an change to the cluster oc- 
curs, e.g., when the node joined a new cluster, or a neighbor 
from different cluster was discovered, or a new cluster was 
formed etc.. In the following we present the life-cycle of CC 
when forming a new cluster. 

1. CHD is invoked to determine if the node will become 
a cluster head or not. 

2. If the node is a cluster-head : JD is invoked to send 
JoinRequest messages to nearby nodes and invite 
them to the cluster. The JoinRequest messages can 
be sent to all available nodes using Broadcast messages 
or to selected nodes using the selected nodes ids. 

3. Upon receiving a Join Request message, CC isolates 
the message's payload and passes it to JD. 

If JD decides to join, a JoinAccept message is sent 
to the originator of the JoinRequest message, IT is 
notified of the address so for it to be saved as the node's 
Cluster-head. Note that if the protocol dictates that 
nodes may join a cluster even if they are at multi-hop 
distance from the cluster-head, then the IT may delay 
the transmission of the JoinAccept message so that 
other nodes are examined (e.g., this is done in the case 
of the depth-first search node traversal). 
If JD decides not to join, a JoinDeny message is gen- 
erated along with a payload from JD and passed to the 
originator of the JoinRequest message. 

4. If a JoinDeny message is received, its payload is passed 
to JD to be examined, in case the neighborhood's con- 
ditions are of interest and the IT is notified in order to 
keep track of which neighbors have joined the cluster 
and which have not. 

5. When all nodes have been examined the membership 
tables are generated by the IT and the process of clus- 
ter formation completes. 

3.3 Implementation Details 



In the following we present a Wiselib concept for each one of 
four basic components. The design goal of the concept is to 
cover many cases while staying generic. This follows from 
the requirement that each module must implement all the 
methods of a concept. 

Core Component (CC) Concept. The CC concept takes 
as template parameters a set of components types such as 
Radio, Timer and Debug that are needed for sending mes- 
sages, registering events and optionally printing debug mes- 
sages. The most important parameters are the types for the 
CHD, JD and IT which the CC will use for the clustering 
algorithm. The first method that initializes the module also 
provides instances of the components that the module will 
use. Then we have two methods for enabling and disabling 
the module, which is useful when it should only be run in 
certain points in time. After the module is enabled, the 
find_head() method is called and starts the cluster forma- 
tion. Next, we have a method for setting the parameters of 
the algorithm, which also sets the parameters for every other 
component. Then, we have a method for registering a call- 
back in order to get notifications upon events. Finally, CC 
provides a set of functions to access useful information such 
as the cluster id, the parent node(if any) etc. The concept 
is defined in Wiselib as follows: 

1 template<typename OsModel , 

2 typename Radio , 

3 typename Timer , 

4 typename Debug , 

5 typename HeadDecision , 

6 typename JoinDecision , 

7 typename Iterator> 

8 class CoreComponent { 

9 public : 

10 void init (Radio&,Timer&,Debug&,CHD&,JD&,IT&); 

11 void enable ( void ) ; 

12 void disable (void ) ; 

13 

14 void set_parameters ( parameters_t *); 

15 void f ind_head ( void ) ; 

16 

17 template<typename T, void (T : : * TMethod ) ( u i n t 8_t ) > 

18 int reg_changed_callback (T* obj ) ; 

19 

20 node_id_t parent () 

21 cluster_id_t cluster_id() 

22 bool i s _c 1 u s t e r _ h e a d ( void ) ; 

23 ... 

24 } ; 

Cluster Head Decision (CHD) Concept. In the CHD 

concept we have a method for setting the parameters (e.g., 
the probability value that the module will use). Addition- 
ally, there is the method for calculating if the current node is 
a cluster- head, and a method to get this result. The concept 
is defined in Wiselib as follows: 

1 template<typename Radio , typename Debug> 

2 class ClusterheadDecision { 

3 public : 

4 void init(Radio& , Debug& ); 

5 void enable ( void ) ; 

6 void d i s a b 1 e ( void ) ; 

7 

8 void set_parameters ( parameters_t *); 

9 bool is_cluster_head ( void ) ; 

10 bool calculate_head ( ) ; 

11 }; 

Join Decision (JD) Concept. In the JD concept we have 
a method that gives the hop count from the cluster-head, af- 



ter the node has joined a cluster. It also provides methods 
that set the payload for specific types of messages. The min- 
imum requirement is three methods, for the JoinRequest, 
the JoinAccept and the JoinDeny messages. Finally, we 
have the method join that is called with a new JoinRe- 
quest payload, and it calculates if it is going to join the 
cluster. The concept is defined in Wiselib as follows: 

1 template<typename Radio , typename Debug> 

2 class JoinDecision { 

3 public : 

4 void init(Radio& , Debug& ); 

5 void enable ( void ) ; 

6 void disable (void ) ; 

7 

8 int hops ( ) ; 

9 void ge t _j o i n_r e qu e s t _p ay lo ad ( b 1 oc k_d at a_t *); 

10 void get_j oin_accept_pay load ( block_data_t *); 

11 void get_j oin_deny_pay load ( block_dat a_t *); 

12 size_t get_payload_length ( int ) ; 

13 bool join(uint8_t *, uint8_t); 

14 }; 

Iterator (IT) Concept. For IT, we provide methods for 
getting the cluster id and the parent of the node. Moreover, 
the next_neighbor() method allows iterating through the 
neighborhood of the node. If the neighborhood information 
is not available, we can register a callback function that the 
Iterator will call to inform us about changes in the neigh- 
borhood. The concept is defined in Wiselib as follows: 

1 template<typename OsModel , 

2 typename Radio , 

3 typename Timer , 

4 typename Debug> 

5 class Iterator { 

6 public : ... 

7 void init(Radio&, Timer&, Debug&); 

8 void enable ( void ) ; 

9 void disable (void ) ; 

10 

11 cluster_id_t c 1 u s t e r _ i d ( void ) ; 

12 node_id_t parent ( void ) ; 

13 node_id_t next_neighbor ( ) ; 

14 

15 template<typename T, void (T ::* TMethod )( u i n t 8_t ) > 

16 int reg_next_callback (T* obj); 

17 

18 private : 

19 vector_t cluster_neighbors_ ; 

20 vector_t non_cluster_neighbors_ ; 

21 node_id_t parent. ; 

22 ... 

23 } ; 



3.4 Base Modules 

In this section, we revisit the clustering techniques as sum- 
marized in Sec. [2] in the light of the above components and 
structure. In the sequel we use the term component to refer 
to one of the basic building blocks of our architecture, as pre- 
sented above, and the term module to refer to an implemen- 
tation of a particular component (or in Wiselib terminology, 
a particular component concept). We implement a total of 
six modules with parameterizable functionality in order to 
cover a wide range of clustering approaches. Therefore, we 
call them the base modules since by combining them we can 
come up with almost all original clustering algorithms exam- 
ined. For example, many algorithms propose each node to 
decide whether to become a cluster-head locally with some 
probability p (e.g., 22 ). Other algorithms propose to do 
this in a deterministic way using specific attributes (e.g., E] 



using local ids, [36] based on node's connectivity, [41] us- 
ing node's remaining energy, or a weighted combination of 
such criteria as in 12 ). Some propose that this is repeated 
periodically every time period t (e.g., ^^). Some others 
propose to select cluster-heads up to k-hop distance. Table 
[3] summarizes the six base modules for each one of the four 
components. 

We understand that it is clearly impossible to come up with 
a small set of modules that is generic enough to implement 
every possible clustering algorithm. However, we do propose 
a small set of modules that can be easily modified and/or 
extended by programmers to speed up the implementation 
effort of a large range of algorithms. Therefore, we attempt 
to cover as many design flavors as possible while staying 
generic. In the following section we showcase how we used 
these modules to fully implement 5 well known algorithms 
and discuss how they can be further modified to implement 
11 others. We believe that this is a solid evidence that our 
approach can help future developers implement applications 
that rely on a specific clustering scheme. 
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probability p every t 
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nates the operations of 
the other components 



Table 3: Implemented Base Modules 



4. EXAMPLE IMPLEMENTATIONS 

We now proceed by examining five characteristic algorithms 
and how they are implemented under our scheme. Interest- 
ingly, in all five cases the CC_NoRM module remains un- 
changed. We note that the partition of the functionality in 
the four components allows us to find easily the places where 
we have to implement algorithm specific code. Then, the se- 
lection of one of the given base modules helps minimize the 
implementation. We identify 11 additional algorithms and 
explain in details how they can be implemented by pointing 
the places where the changes need to be made and sketching 
the actual implementations. All modules that will be de- 
scribed in the sequel are named under the rule name_role 
where name is the name of the algorithm and role can be 



either CHD, jd or IT. 

LCA. A classic algorithm that forms k-hop clusters |4]. Ev- 
ery node becomes a cluster-head with a probability p. All 
cluster-heads advertise themselves to every node within k 
hops and all simple nodes join the cluster of the closest 
cluster-head. The implementation of LCA is derived with- 
out any single change from the base modules PROB, BFS and 
NORM of Tab. [3] Algorithms like 39 36 can also be imple- 
mented in a straight forward way. This clearly demonstrates 
that application designer can come up with prototypes in a 
very short time frame. 

LEACH. A well studied algorithm that periodically rotates 
cluster-heads [22]. Every node i decides independently on 
predefined intervals to become a cluster-head with proba- 
bility Pi. The value of pi is based on the node's previous 
role and the desired number of clusters in the network (k). 
So every node becomes periodically a cluster-head and the 
cluster-head's extra workload is distributed evenly amongst 
every node. All cluster-heads advertise themselves as in LCA 
but only 1 hop away. Simple nodes decide to join the cluster- 
head that requires the minimum communication energy (us- 
ing the RSSI value) and inform the selected cluster-head of 
their decision. For the implementation of LEACH we mod- 
ify PROB so that p is calculated internally every time the 
cluster is reformed. Interestingly, for the JD module we use 
the implementation of LCA (i.e., lca_jd) with very minor 
modifications (about 8% of the total lines of code). Finally 
we use the base norm iterator module as is. Based on these 
modules we can then implement the algorithm of 41 and 
[29| that uses a SNR instead of RSSI for the join condition, 
while [5] uses the same CHD but for /c-hops. 

TCCA. In this algorithm, cluster-heads are selected based on 
the residual energy 33 . The cluster-heads are periodically 
selected based on a fixed probability and by taking into ac- 
count the available energy of the node. We use the base 
PROB module by modifying it to include this extra design 
choice. Then the JD module takes into consideration the 
residual energy of the cluster-head as well as the distance 
of the node to it. This modification is again very easy to 
implement by modifying the join criteria. 

MOCA. This algorithm forms A:-hop overlapping clusters [43] . 
Cluster-head decision is exactly the same as in the base mod- 
ule PROB. For the JD component we use BFS but allow a node 
to belongs to all clusters of up to k hops away in order to 
achieve overlapping clustering. To complete the implemen- 
tation we also need to make changes to the norm iterator 
component. Due to the fact that more than one clusters 
are selected the variables and data structures provided by 
the base module are not appropriate. Instead of using single 
variables, all normal nodes use a table to store the ids of the 
cluster-heads they belong and cluster-heads use a table to 
store the the ids of all adjacent clusters. This algorithm is 
clearly different from the previous ones. The proposed de- 
sign guides us to the places where changes need to be made. 
Once again the base modules are heavily reused. Even for 
the different iterator component, more than 35% of the origi- 
nal code is present in this algorithm. Interesting, the code of 
11 that also forms overlapping clusters is extremely similar 
to the code we just discussed. 



Many algorithms seem to use ideas similar to those presented 
above. Such algorithms are presented in [2l] 40 18] 35 that 
exchange only one attribute over one or more hops at regu- 
lar intervals or every time a special event is detected. Other 
algorithms like 12 exchange multiple attributes like mobil- 
ity, node degree, time as cluster-head and power consumed. 
All these algorithms can be incorporated in our library with 
extremely small development effort. 

MaxMinD. In |2 a very different algorithm is presented that 
forms clusters after examining all nodes at d — hop distance. 
Cluster-heads are elected after an id exchange phase of 2d 
rounds, using a heuristic function on all received ids. Un- 
like all previous algorithms the cluster join decision is made 
automatically and nodes join the cluster of their selected 
cluster- heads. Every simple node then informs its one hop 
neighbors of its decision and nodes connected to multiple 
clusters become gateways. All gateway nodes finally inform 
their cluster-head of their connections to other clusters dur- 
ing the converge cast phase. 

This algorithm represents a totally different cluster forma- 
tion approach where each node elects its cluster-head and 
joins the cluster at the same time. This requires very dif- 
ferent design on the CHD and JD modules. To accommodate 
the special design of MaxMinD we implement the message 
exchange using pay loads provided by the JD module. The 
cluster join condition is in fact trivial after the cluster that 
the node will join is decided. The iterator module then re- 
quires some extra payloads for the converge cast phase that 
MaxMinD employs. 

The unique design of MaxMinD gives us a good opportu- 
nity to make a first attempt to evaluate the component- 
based design that we propose. In particular we repeat the 
implementation by following a stand-alone monolithic ap- 
proach. So now we can compare the two versions based on 
their lines of code and the resulting binary size. Indeed, 
code size is not the best code metric available. Component- 
based implementations usually require more lines of code 
since they include class definitions and require additional 
functions declarations so that information of private mem- 
bers is exchanged between modules. Still, it can give us 
a rough estimate on the effort needed. So, keeping these 
points in mind, the monolithic version consists of 1200 lines 
of code whereas the component-based version is just over 
2000 lines long. However, in the component-based version 
almost 1000 lines were reused from the existing base mod- 
ules. In terms of code size, the component-based executable 
is 13738 bytes long while the monolithic is 11951. This met- 
ric reflects better the growth of the code. It indicates that 
for the particular algorithm the price (in terms of code size) 
for carrying out a well organized and modular implementa- 
tion requires an increase of 14%. Clearly, this extra code size 
is the result of a more readable code which is therefore eas- 
ier to debug, maintain and extend. We strongly believe that 
MaxMinD is a characteristic example of how even irregular 
algorithms (in terms of design) can be incorporated into our 
component-based design with less effort than implementing 
it in a stand-alone monolithic way. 

We carry out the same comparison with the other algorithms 
by also implementing them as monolithic modules. Interest- 



ingly, we can see similar difference in code size between each 
version. In some sense this is expected since all component- 
based implementations use the same interfaces and thus in- 
corporate similar set of functions with a fixed code size. 

We also note that as the library grows in size, the effort 
to develop new algorithms will reduce as more modules will 
become available. However, the real benefit of having a large 
set of implemented modules is the ability to experimental- 
drive the design of new clustering algorithms. Specific design 
choices that work well in particular network types can be 
easily transfered in new algorithms that also combine other 
modules with different specifications and goals. Inevitably, 
the availability of a large set of modules will help us develop 
sophisticated systems with reduced implementation effort. 



Algorithm 


Module 


Parent 


LOC 


Diff 


(%) 




PROB 


PROB 


149 





0% 


LCA 


LCA_JD 


BFS 


204 


16 


8% 




NORM 


NORM 


318 
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MOCA 


MOCA_JD 


BFS 
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69 


43% 




MOCA_IT 


NORM 


390 
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MAXMIND_CHD 


ATTR 
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MaxMinD 


MAXMIND_JD 


BFS 


237 


162 


68% 




MAXMIND_IT 


NORM 


528 


399 


75% 



Table 4: Implemented Algorithms, inherited Base 
Modules (see Tab. Isl) and modified lines of code 



5. VALIDITY OF IMPLEMENTATION 

In this section we wish to evaluate the faithfulness of the 
implementation of the clustering algorithms under our ap- 
proach. In order to do this we comparatively study the 
results of the performance evaluation of the protocols un- 
der our approach and the one that their designers did. The 
comparative study is conducted in a simulated environment 
created using Shawn |28j, a simulation framework that fo- 
cuses on an abstract, repeat able and expressive approach 
to WSN simulation. By replacing low-level effects with ab- 
stract and exchangeable models, the simulation can be used 
for huge networks in reasonable time while keeping the focus 
on the actual research problem. It provides many options 
such as packet loss, radius of communication, ways of com- 
municating and even mobility in an abstract way, without 
needing to provide specific code for every hange. 

Although it is very difficult to reproduce the exact topologies 
for the experiments, we try to generate topologies with as 
similar characteristics as the ones used in the original pub- 
lications. Also some minor differences are to be expected 
since we do not use the same simulation environment. The 
results of this comparative study is very positive as the val- 
ues acquired by our performance analysis are very close to 
the ones acquired from the original studies. Our values seem 
to confirm a valid behavior of the modular implementations 
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Figure 2: Comparison of Original Implementation 
of MaxMinD and Our Modular Implementation re- 
garding # of Cluster Heads and Avg. Cluster Size 

of the protocols. 

For MaxMinD, as we can see in [2], the original experiments 
create networks with up to 600 nodes in a 200 x 200 units 
region and communication range of 20 units. The parame- 
ter d was set to 2 and the main metrics extracted are the 
number of cluster-heads generated and the size of the clus- 
ters formed. Fig. [2] depicts the results from our performance 
analysis (marked as Modular) in comparison to the results 
from the original analysis of conducted in [2] . As we can see 
from the simulations we performed, the clusters formed have 
almost identical size to the originals while the cluster-head 
count is very similar. The minor discrepancy observed in 
the number of cluster-heads for small topologies is related 
to the distribution of the node ids in the network. In Max- 
MinD, the position of each node id is crucial to the number 
of cluster-heads generated as the heuristic that is used is 
based on the maximum ids transmitted during each round 
and sequentially the way the ids are placed. As we do not 
know the exact position of all node ids used in the original 
experiments it is difficult to achieve the same result. In our 
experiments we used a uniform random distribution. 

For Moca, the simulations performed in 43 focus on spe- 
cial features of overlapping clustering algorithms. Metrics 
like the number of covered nodes during the initial phase of 
the algorithm, cluster overlapping degree and orphan clus- 
ters is extracted. The size of the clusters formed is also 
measured. Again, we try to reproduce the same simulation 
environment. We create a world with 400 nodes and a node 
density of 9 neighbors. We simulate cluster formation with 
cluster-head probability of up to 50% and cluster diameters 
of 2, 3 and 4 hops. As we can see in Fig.|3]the percentage of 
the covered nodes achieved by our modular implementation 
is almost identical with that achieved by the original imple- 
mentation. Moreover, in order to evaluate the size of the 
clusters formed by Moca, we need to generate topologies of 
400 nodes and density of up to 21 neighbors. We use a fixed 
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Figure 3: Comparison of Original Implementation of 
Moca and Our Modular Implementation regarding 
Percentage of Covered Nodes and Avg. Cluster Size 



cluster-head probability of 15% and increase the diameter of 
the clusters formed. Once again the results gathered from 
our evaluation, as shown in Fig. [3] indicate clearly that the 
performance of our modular implementation is almost iden- 
tical to that the one achieved by the implementation of the 
algorithm's designers. 

For the other three algorithms, the simulation study con- 
ducted in [4) [22| [33] is very limited and/or is focused on 
energy related aspects that we do not investigate in this 
work. Still, even for this limited set of results (e.g., num- 
ber of elected cluster-heads for a specific topology of 100 
nodes), the corresponding results of the evaluation of our 
implementations are almost identical. Furthermore, the per- 
foramnce evaluation conducted in the following sections pro- 
duce results that are justifiable by the qualitative analysis 
conducted in the original publications of all five algorithms. 

6. PERFORMANCE EVALUATION 

In this section we evaluate the performance of the base mod- 
ules based on large-scale simulations so that we can assess 
their scalability and experiments using hardware to assess 
their effectiveness in real environments. 

The simulator experiments allow us to evaluate the scalabil- 
ity of the base modules in various network topologies. We 
work with two different types of network topologies. The 
first set of topologies have fixed network diameter: as we 
increase the number of nodes the network density increases. 
The second set of topologies have fixed node density: as the 
number of nodes increases so does the network diameter. In 
the second set, we generate networks with a density of 8 (i.e., 
on average each node is connected with 8 other nodes) and 
with density of 15. The number of nodes for each topology 
with fixed density varies form 10 to 10000 and for the second 
case with diameters of 1 to 100 hops. The fixed diameter 
topologies start from 10 nodes with 4 average neighbors and 
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Figure 4: Performance of chd modules for different 
network sizes of fixed density 



reach 1000 nodes with 256 neighbors and a network diameter 
of 5 hops. For higher densities the equivalent graphs become 
fully connected and thus they are useless for our evaluation. 
On each experiment we focus on the amount of time each 
module requires to complete; time its counted in simulation 
rounds. We also measure the total number of messages sent. 
Finally, we evaluate the average size of the clusters formed. 
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Figure 5: Performance of jd modules for diff*erent 
network sizes of fixed density or fixed diameter 



Through out all the simulations for the cluster-head phase 
we use the ATTR to elect cluster-heads with minimum ids 
in 1 and 3 hops, prob and maxmind_chd. For the clus- 
ter formation we use bfs, dfs, maxmind_jd and moca_jd. 
Finally, we evaluated norm, moca_it and maxmind_it. 

We start by assessing the performance of the cluster-head 
decision modules as depicted in Fig. ^ Note that similar 
results hold for the networks of fixed diameter. For space 
limitations we only include the results from the fixed density 
networks. As expected, all modules require constant number 
of rounds to complete. The total time required is in fact de- 
termined by the specific parameters. Similarly, the number 
of messages exchanged is linear to the /c-hop parameter of 
the modules. Clearly, the PROB module achieves the best 
scalability since the decision is local and requires a constant 
number message exchanges. 

The performance of the join decision modules is depicted in 
Fig. |5] Again, the time efficiency of all modules is directly 
affected by the /c-hop parameter (or d-hop parameter for 
MaxMinD). Similar results hold for the number of messages 
exchanged. As we can see, MaxMinD takes almost no time 
to complete and requires no messages. dfs_jd needs more 
time than any other module as all available nodes have to 
be invited in a depth first manner and more interaction be- 
tween the nodes take place. When using the fixed diameter 
topologies we can see that the increased node density has a 
great effect on the operation of dfs_jd as clusters grow in 
size and more nodes have to be invited. 



Finally, we evaluate the size of the clusters formed using 
different combinations of modules. Fig. |6] depicts the results 
for both types of topologies. As expected, in the fixed node 
density topologies all clustering schemes produce clusters 
with the same average cluster size. In the fixed diameter 
topologies, as the network diameter is small some algorithms 
like MaxMinD form a single cluster. As the network diameter 
increases, the number of clusters produced increases linearly. 

In order to understand the performance of the modules in 
real hardware we continue our evaluation by conducting ex- 
periments in a local testbed comprised of up to 10 iSense 
W nodes placed on a 1 hop network. These experiments 
allow us to measure real time and not simulated rounds. 
For each experiment, we assess the time required by each 
module to complete and the total number of messages ex- 
changed. Compiling our application and using it with the 
iSense sensors required minimal modifications as everything 
was developed under the Wiselib library that fully supports 
the iSense platform. 

In general, the results of all experiments were consistent with 
the results gathered during the simulator experiments. Of 
course, the data of the experiments is clearly in a smaller 
scale. Moreover, note that some minor differences in oper- 
ation time were observed as perfect synchronization of the 
nodes and fully reliable communication channels are impos- 
sible to achieve in real environments. Due to space limita- 
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tions we only include Fig. |7| that depicts the results for the 
JD modules. Both bfs and dfs perform as expected and 
the second one needs more time to invite all 10 nodes in the 
cluster. Clearly, the information gathering approach up to 
d-hops distance of MaxMinD hinders the overall performance 
of the system. Still, all three modules complete in very short 
time (< 20sec) and short message exchanges (< 80). 

Apart from assessing the performance of the implemented 
modules, this study also demonstrates the benefit of our 
approach. With truly minimum effort (we just modify a 
parameter in the constructor of some modules and then re- 
compile) we can examine the performance of a wide range 
of algorithms. Additionally, by mixing and matching base 
modules we can come up with new algorithms and easily 
compare their performance with previous versions. Essen- 
tially, this creates an ideal platform for comparing different 
ideas and design choices under a common framework as we 
can instantly test our ideas in simulated or real environ- 
ments and easily fine tune the performance of the developed 
applications. 

7. APPLICATIONS 

In the previous sections we discussed about the benefits of 
our approach in terms of reducing the implementation effort 
for developing clustering algorithms. We also show cased 
how to easily evaluate the performance of protocol variants 
or new protocols in a simulated and experimental frame- 
work. In this section, we discuss a third aspect of our ap- 
proach that is of paramount importance to the development 
of WSN applications. 

Developing clustering algorithms by itself is of absolutely no 
use. The true purpose of organizing the network in clusters 
is to improve the scalability of the network when perform- 
ing other operations such as data aggregation, routing, en- 
ergy conservation etc.. Currently implemented algorithms 
are loosely coupled with routing protocols thus rendering 
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them useless for improving the performance of other tasks. 
Our approach is totally different. The proposed architecture 
allows to include clustering algorithms as sub-components of 
other algorithms that deal with totally different problems. 
Interestingly, the clustering algorithms are tightly coupled 
with the higher-layer algorithm and in this way they can be 
easily exchanged with other ones. Essentially, this allows 
us to split the development and evaluation of our system in 
two different phases. We can first develop our system using 
one of the existing clustering algorithms and then during 
the evaluation, as demonstrated in the previous section, fine 
tune the clustering algorithm to maximize the performance 
of the higher level system. 

7.1 Hierarchical routing 

Clustering can improve the scalability of routing. After clus- 
ters are set-up, routing can be performed either in intra- 
cluster level or in inter-cluster level. In the first case, a 
node that needs to communicate with another one in the 
same cluster establishes a route to it (directly or through 
the cluster- head). In the second case, a node who wishes to 
reach another one that belongs to a different cluster, firstly 
communicates with its cluster-head who is now responsible 
to construct a route towards the destination's node cluster- 
head and then reach the final node through intra-cluster 
routing. Thus, one could observe that cluster-heads form an 
upper level of hierarchy that facilitates the routing process. 

In order to address cluster-level routing in our component 
based approach, we propose the ClusterRadio component. 
This component consists of two mechanisms named Intr- 
aClusterRadio and InterClusterRadio that are built 
based on the information provided by the it component. 
IntraClusterRadio stores local information about nodes 
that belong to a certain cluster and it is able to set-up routes 
(if necessary) inside this cluster (e.g. route to the CH). The 
InterClusterRadio component is responsible for discov- 
ering a cluster's gateway nodes - nodes that are close and 



thus able to communicate with neighboring clusters. This 
way, when a node wants to reach a node at a different cluster, 
the cluster's gateway nodes forward the request to neighbor- 
ing clusters searching for the desired cluster. After the route 
to the destined cluster has been setup the destination node 
can be reached through intra-cluster routing. 

The result of the above component is that the application 
developer can easily replace the flat routing algorithm with 
a cluster-based routing. Furthermore, the routing choice 
for inside the cluster and outside the cluster can be easily 
changed due to the component-based architecture. 

7.2 Group Key Establishment 

Another application of clustering is in the field of securing 
communication exchanges within the network by combin- 
ing them with Group key establishment (GKE) algorithms. 
GKE is the procedure of setting up secret cryptographic 
keys between groups of nodes. This essentially guarantees 
to some extend the confidentiality and integrity of the infor- 
mation exchanged. A wide variety of GKE protocols based 
on asymmetric and symmetric cryptographic techniques has 
been proposed so far, e.g. |27| |30l |17]. 



The big challenges when designing a GKE protocol for sensor 
networks are scalability and efficiency [l3]. Although, the 
proposed protocols aim at these goals, a GKE mechanism 
could execute faster and more efficiently when applied af- 
ter a clustering algorithm has divided the network into clus- 
ters. Moreover, certain GKE protocols like [44] are based on 
the assumption that the network is already organized into 
groups. Thus, one can realize that clustering techniques 
can improve the efficiency of computationally heavy proto- 
cols like GKE protocols and respectively the overall network 
performance. 

As an application example, we implement the GKE algo- 
rithm proposed in ^13j. It is based on Elliptic Curve Cryp- 
tograhy (EGG) and it employs a depth first traversal that 
visits all the network participants who contribute to the 
group key. To do this we register two callback methods at 
the CC so that the common key is re-computed when each 
node joins the cluster and the key is finalized with the last 
node addition. The keys are generated using the EGG opera- 
tions (point multiplication, encryption/decryption) provided 
by Wiselib. Then we use the prob, dfs and norm com- 
ponents to fix the operation of the cluster algorithm. The 
overall implementation requires a total of 11 lines of code. 
We evaluate the performance of the resulting implementa- 
tion in our iSense hardware. For the topology of 10 nodes 
it requires approximately 7 minutes to compute a common 
key of 163 bits. Remark that EGG-based cryptography of 
163-bit is equivalent to 1024-bit RSA keys. 

In a very similar way, protocols like [44 30 can be imple- 
mented by employing the necessary cryptographic mecha- 
nism and by using the information provided by the it com- 
ponent after the cluster formation has finished. 

8. CONCLUSIONS 

In this paper we study a large body of 90 clustering algo- 
rithms as presented in 8 recent surveys and tutorials. We 
focus on 23 algorithms that are (i) widely studied by the rel- 



evant bibliography, (ii) appeared very recently in very rele- 
vant competitive conferences and/or (iii) are implemented in 
a well known WSN platform. We identify their main build- 
ing blocks and present a very limited set of base modules 
that are parameterizable and can be interchanged to imple- 
ment existing algorithms or new ones under study. This 
process is of great value for a generic algorithm engineering 
approach. 

We carefully select 5 out of the 23 algorithms and explain in 
details how we combine and parametrize the 6 base modules 
to implement them. We conduct an extended simulation 
study to demonstrate the faithfulness of our implementa- 
tions when compared to the original implementations. For 
all cases, our simulations are at very large scale thus also 
demonstrating the scalability of the original algorithms be- 
yond their original presentations. Moreover, we conduct ex- 
periments and assess their practicality in real WSN. This 
also demonstrates the capability of our approach to make 
code easily available to the community that are hardware 
and OS independent. 

Our modular architecture, the implementation of the al- 
gorithms using multiple components and the Wiselib en- 
vironment provides a common platform so that compar- 
isons between algorithms is easily accessible. Developers 
can easily mix and match modules in order to provide new 
clustering variants that best fit their application specifica- 
tions. The ability to implement new algorithm variants with 
minimum effort is of significant importance for conducting 
experimental-driven research. 

We propose a component-based architecture for developing 
clustering algorithms that promotes exchangeability of al- 
gorithms and cross-layer implementations. We examine two 
important problems of hierarchical routing and group key 
establishment and show how they can be implemented by 
exploiting the proposed clustering architecture. We demon- 
strate how our approach makes the integration of algorithms 
more feasible. For the case of group-key establishment we 
also conduct experiments to assess the overall performance 
of the resulting system in a real environment. 

Our study clearly demonstrates the applicability of our ap- 
proach and the benefits it offers to both research & develop- 
ment communities. Our code is open-source and is publicly 
available for download and use. Due to the blind-review 
process we omit the url of our code. 
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